De novo assembly and genomic structural variation analysis with genome sequencer FLX 3K long-tag paired end reads.

نویسندگان

  • Thomas Jarvie
  • Timothy Harkins
چکیده

The Genome Sequencer FLX System from Roche and 454 Life SciencesTM is a versatile sequencing platform suitable for a wide range of applications, including de novo sequencing and assembly of genomic DNA, transcriptome sequencing, metagenomics analysis, and amplicon sequencing. The Genome Sequencer FLX enables long sequence reads separated by kilobase distances of genomic DNA. These Long-Tag Paired End reads enable improved de novo assemblies and genomic structural variation studies. 454 Life Sciences has developed and commercially released a new protocol for generating a library of paired-end fragments to determine the orientation and relative positions of contigs produced by de novo shotgun sequencing and assembly. This 3K Long-Tag Paired End protocol (Figure 1) can also be used to identify genomic structural variations (1) and their associated breakpoints. Structural variation of the genome, involving large, kiloto mega-base-sized deletions, duplications, insertions, inversions, and complex combinations of rearrangements, is widespread in humans and is presumably responsible for a considerable amount of phenotypic variation. The 3K Long-Tag Paired End library DNA fragments contain an approximately 250-bp fragment with a 44-mer adaptor sequence in the middle, flanked by 100-mer sequences, on average. The two flanking 100-bp sequences are segments of DNA that were originally located approximately 3 kb apart in the genome of interest. In addition to the 3-kb paired-end protocol, initial results from an unreleased protocol that generates flanking reads separated by 16 kb are presented (Figure 2). The 16-kb protocol utilizes a different chemistry than the 3-kb protocol described here. Traditional approaches to the sequencing of paired-end reads rely upon inserting a DNA fragment into a vector, such as a BAC or fosmid, cloning into bacteria, and subsequently generating two sequences, one from each end of the vector. These methods entail weeks of laboratory work and could cost several hundred thousand dollars to prepare the libraries needed for Sanger sequencing. The Genome Sequencer FLX method presented here, which requires no cloning, generates up to 200,000 paired-end reads from a single Genome Sequencer FLX instrument run with a total elapsed time — from genomic DNA to result — of less than four days. SAMPLE PREPARATION PROTOCOL

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comprehensive transcriptome analysis with the Genome Sequencer FLX System

Protein isoforms make the transcriptome complex and challenging to resolve. Different forms of a protein may be produced from related genes or may arise from the same gene by alternative splicing. With a combination of long (400–500 bp) 454 SequencingTM reads, dedicated GS De Novo Assembler software and a straightforward protocol using just 200 ng of RNA as sample input, the Genome Sequencer FL...

متن کامل

de novo assembly and population genomic survey of natural yeast isolates with the Oxford Nanopore MinION sequencer

BACKGROUND Oxford Nanopore Technologies Ltd (Oxford, UK) have recently commercialized MinION, a small single-molecule nanopore sequencer, that offers the possibility of sequencing long DNA fragments from small genomes in a matter of seconds. The Oxford Nanopore technology is truly disruptive; it has the potential to revolutionize genomic applications due to its portability, low cost, and ease o...

متن کامل

Oxford Nanopore MinION Sequencing and Genome Assembly

The revolution of genome sequencing is continuing after the successful second-generation sequencing (SGS) technology. The third-generation sequencing (TGS) technology, led by Pacific Biosciences (PacBio), is progressing rapidly, moving from a technology once only capable of providing data for small genome analysis, or for performing targeted screening, to one that promises high quality de novo ...

متن کامل

De novo finished 2.8 Mbp Staphylococcus aureus genome assembly from 100 bp short and long range paired-end reads

MOTIVATION Paired-end sequencing allows circumventing the shortness of the reads produced by second generation sequencers and is essential for de novo assembly of genomes. However, obtaining a finished genome from short reads is still an open challenge. We present an algorithm that exploits the pairing information issued from inserts of potentially any length. The method determines paths throug...

متن کامل

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the next-generation DNA sequencer (NGS) Roche/454 and Illumina/Solexa systems, along with bioinformation analysis technologies of whole-geno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • BioTechniques

دوره 44 6  شماره 

صفحات  -

تاریخ انتشار 2008